APPLICATIONS OF STEIN'S METHOD FOR CONCENTRATION 

INEQUALITIES 



SOURAV CHATTERJEE AND PARTE A S. DEY 



Abstract. Stein's method for concentration inequalities was introduced to prove concen- 
tration of measure in problems involving complex dependencies such as random permuta- 
tions and Gibbs measures. In this paper, we provide some extensions of the theory and 
three applications: (1) We obtain a concentration inequality for the magnetization in the 
Curie- Weiss model at critical temperature (where it obeys a non-standard normalization and 
super-Gaussian concentration). (2) We derive exact large deviation asymptotics for the num- 
ber of triangles in the Erdos-Renyi random graph G{n,p) when p > 0.31. Similar results 
are derived also for general subgraph counts. (3) We obtain some interesting concentration 
inequalities for the Ising model on lattices that hold at all temperatures. 



1. Introduction 

In his seminal 1972 paper [36] , Charles Stein introduced a method for proving central limit 
theorems with convergence rates for sums of dependent random variables. This has now come 
to be known as Stein's method. The technique is primarily used for proving distributional 
limit theorems (both Gaussian and non-Gaussian). Stein's attempts [37] at devising a version 
of the method for large deviations did not prove fruitful. Some progress for sums of dependent 
random variables was made by Raic [35| . The problem was finally solved in full generality 
in [10] . A selection of results and examples from f[o\ appeared in the later papers [lT||12| . 
In this paper we extend the theory and work out three further examples. The paper is fully 
self-contained. 

The sections are organized as follows. In Section [2] we state the main results, the examples, 
and some proof sketches. The complete proofs are in Section [3j 

2. Results and examples 
The following abstract theorem is quoted from [11]. It summarizes a collection of results 



from 10 . This is a generalization of Stein's method of exchangeable pairs to the realm of 



concentration inequalities and large deviations. 



Theorem 1 ([ll], Theorem 1.5). Let X he a separable metric space and suppose {X,X') is an 
exchangeable pair of X -valued random variables. Suppose / : A' — t- R and F : X x X ^J^. are 
square-integrable functions such that F is antisymmetric (i.e. F{X,X') = —F{X',X) a.s.), 
and E{F{X,X') \ X) = f{X) a.s. Let 

A(X) ■.= \E{\{f{X)- f{X'))F{X,X')\\X). 
Then E(/(X)) = 0, and the following concentration results hold for f{X): 
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(i) // E(A(X)) < oo, then Var(/(X)) = i E((/(X) - f{X'))F{X, X')). 
(a) Assume that 'E(e^-^^'^'^\F(X, X')\) < oo for all 9. If there exists nonnegative constants B 
and C such that A(X) < Bf{X) + C almost surely, then for any t >0, 



t" \ . ^.„._ . / t" 



P{/W>t}<exp(^-^^^^J and P{/(X) < -t} < exp ^ 

[Hi) For any positive integer k, we have the following exchangeable pairs version of the 
Burkholder-Davis- Gundy inequality: 

WiX f) < - 1)'^E(A(X)^). 

Note that the finiteness of the exponential moment for all ensures that the tail bounds 
hold for all t. If it is finite only in a neighborhood of zero, the tail bounds will hold for t less 
than a threshold. 

One of the contributions of the present paper is the following generalization of the above 
result for non-Gaussian tail behavior. We apply it to obtain a concentration inequality with 
the correct tail behavior in the Curie- Weiss model at criticality. 

Theorem 2. Suppose {X, X') is an exchangeable pair of random variables. Let F{X, X'), f{X) 
and A(X) be as in Theorem^ Suppose that we have 

A(X) < ipifiX)) almost surely 

for some nonnegative symmetric function tp on IR. Assume that ip is nondecreasing and twice 
continuously differ entiable in (0, oo) with 

a : = supxV''(a;)/'i/'(a^) < 2 (1) 

and 5 : = sup 3;^/>"(2;)/V'(x) < oo. (2) 
x>0 



P(|/(X)| >t)< cexp 



Assume that E(|/(X)|^) < oo for all positive integer k > 1. Then for any t > we have 

2i,{t)) 

for some constant c depending only on a, 5. Moreover, if tp is only once differentiate with 
a < 2 as in then the tail inequality holds with exponent t^/4'0(t). 

An immediate corrolary of Theorem [2] is the following. 

Corollary 3. Suppose {X, X') is an exchangeable pair of random variables. Let F{X, X'), f{X) 
and A(X) be as in Theorem^ Suppose that for some real number a G (0, 2) we have 

A{X) < B + C almost surely 

where B > 0,C > are constants. Assume that E(|/(X)|'^) < oo for all positive integer 
k > 1. Then for any t > we have 

P(|/(X)|>t)<c„exp(-i.^^ 

for some constant Ca depending only on a. 

The result in Theorem [2] states that the tail behavior of f{X) is essentially given by the 
behavior of /(X)2/A(X). Condition ^ implies that < + x^) for ah x G E. 

Moreover, the constant Cq, appearing in Theorem [2] can be written down explicitly but we did 
not attempt to optimize the constant. The proof of Theorem [2] is along the same lines as 



STEIN'S METHOD FOR CONCENTRATION INEQUALITIES 



3 



Theorem [T| but somewhat more involved. Deferring the proof to Section |3] let us move on to 
examples. 

2.1. Example: Curie- Weiss model at criticality. The 'Curie- Weiss model of ferromag- 
netic interaction' at inverse temperature (3 and zero external field is given by the following 
Gibbs measure on {+1, —1}". For a typical configuration a = ((7i,(T2, . . . ,0"^) G {+1, — 1}" 
the probability of cr is given by 




where = Zj3(n) is the normalizing constant. It is well known that the Curie- Weiss model 
shows a phase transition at /3c = 1. For /? < /3c the magnetization m((T) := ^ Yli^=i "^j 
concentrated at but for /3 > /3c the magnetization is concentrated on the set {—x*,x*} 
where x* > is the largest solution of the equation x = tanh(/3j;). In fact using concentration 
inequalities for exchangeable pairs it was proved in ^lOj (Proposition 1.3) that for all (3 > 
0, /i G R, n > 1, t > we have 

/ t \ f f 

P |m - tanh(/3m + h)\>- + —= \ < 2 exp ' 



n - ^ V 4(1 + /3)^ 

where h is the external field, which is zero in our case. Although a lot is known about this 
model (see Ellis [19] Section IV. 4 for a survey), the above result - to the best of our knowledge 
- is the first rigorously proven concentration inequality that holds at all temperatures. (See 
also [l4] for some related results.) 

Incidentally, the above result shows that when /3 < 1, the magnetization is at most of 
order n~^/^. It is known that at the critical temperature the magnetization m(cr) shows 
a non Gaussian behavior and is of order n~^/^. In fact, at /3 = 1 as n — )• oo, n^/^m(<T) 
converges to the probability distribution on R having density proportional to exp(— 1^/12). 
This limit theorem was first proved by Simon and Griffiths [23] and error bounds were obtained 
recently |13[|16| . The following concentration inequality, derived using Theorem [2| fills the 
gap in the tail bound at the critical point. 

Proposition 4. Suppose a is drawn from the Curie- Weiss model at the critical temperature 
/3 = 1. Then, for any n > 1 and t > the magnetization satisfies 

P(ni/V('^)l >t)< 2e-"*' 
where c> is an absolute constant. 

Here we may remark that such a concentration inequality probably cannot be obtained by 
application of standard off-the-shelf results (e.g. those surveyed in Ledoux the famous 
results of Talagrand [38] or the recent breakthroughs of Boucheron, Lugosi and Massart 
joj), because they generally give Gaussian or exponential tail bounds. There are several 
recent remarkable results giving tail bounds different from exponential and Gaussian. The 
papers [l4 20 , 29 deal with tails between exponential and Gaussian and [l}[4] deal with sub- 
exponential tails. Also in [5,21 ,22 the authors deal with tails (possibly) larger than Gaussian. 
However, it seems that none of the techniques given in these references would lead to the result 
of Proposition |4j 



It is possible to derive a similar tail bound using the asymptotic results of Martin-Lof 31 
about the partition function Zp{n) (see also Bolthausen fl\). An application of their results 



4 



SOURAV CHATTERJEE AND PARTHA S. DEY 



gives that 



VI -120. 



(Te{-i,+i}' 

for < 1/12 in the sense that the ratio of the two sides converges to one as n goes to 
infinity and from here the tail bound follows easily (without an explicit constant). However 
this approach depends on a precise estimate of the partition function (for example, large 
deviation estimates or finding the limiting free energy limn~^ log Zf^{n) are not enough) and 
this precise estimate is hard to prove. Our method, on the other hand, depends only on simple 
properties of the Gibbs measure and is not tied specifically to the Curie- Weiss model. 

The idea used in the proof of Proposition [4] can be used to prove a tail inequality that holds 
for all < /? < 1. We state the result below without proof. Note that the inequality gives the 
correct tail bound for all < /3 < 1. 

Proposition 5. Suppose a is drawn from the Curie-Weiss model at inverse temperature f3 
where < /3 < 1. Then, for any n > 1 and t > the magnetization satisfies 

P(3(l - /3)m(cr)2 + l3^m{cT)^ > t) < 2e-"*/i6°. 

It is possible to derive similar non-Gaussian tail inequalities for general Curie- Weiss models 
at the critical temperature. We briefiy discuss the general case below. Let p be a symmetric 
probability measure on R with J dp{x) = 1 and J exp(/3x^/2) dp{x) < oo for all /3 > 0. 
The general Curie- Weiss model CW(p) at inverse temperature (3 is defined as the array of 
spin random variables X = {Xi,X2, ■ ■ ■ , Xn) with joint distribution 

dun{x) = exp f ^ (xi + X2 H h x„)^ j JJdp(xi) (3) 

for X = (xi, X2, . . . , x„) G R'" where 

= y exp^^(xi + X2H h x„)^^ ]^dp(xi) 

is the normalizing constant. The magnetization m(x) is defined as usual by m(x) = X^ILi 
Here we will consider the case when p satisfies the following two conditions: 

(A) p has compact support, that is, /?([— L, L]) = 1 for some L < oo. 

(B) The equation /i'(s) = has a unique root at s = where 

h{s) := — — log / exp(sx) dp{x) for s G R. 

The second condition says that h{-) has a unique global minima at s = and |/i'(s)| > for 
|s| > 0. The behavior of this model is quite similar to the classical Curie- Weiss model and 
there is a phase transition at /5 = 1. For (3 < 1, m(X) is concentrated around zero while for 
/3 > l,m(X) is bounded away from zero a.s. (see Ellis and Newman [T7[|18| ). We will prove 
the following concentration result. 

Proposition 6. Suppose X ~ m„ at the critical temperature /3 = 1 where p satisfies condition 
(A) and (B). Let k be such that /i(*)(0) = for < i < 2k and /i(2fc)(0) / 0, where 

,2 



■5 f 

^(^) '■— — -^OS / exp(sx) dp{x) for s G R 
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and /i*-*) is the i-th derivative ofh. Then, k > 1 and for any n > 1 and t >0 the magnetization 
satisfies 

P(ni/2fc|m(X)| >t)< 26-''^' 

where c > is an absolute constant depending only on p. 

Here we mention that in Elhs and Newman ^17j, convergence results were proved for the 
magnetization in CW{p) model under optimal condition on p. Under our assumption their 
result says that n^^'^^m(X.) converges weakly to a distribution having density proportional to 
exp(— Ax^'^/(2A:)!) where A := h^'^''\0). Hence the tail bound gives the correct convergence 
rate. 

Let us now give a brief sketch of the proof of Proposition |4j Suppose cr is drawn from the 
Curie- Weiss model at the critical temperature. We construct cr' by taking one step in the 
heat-bath Glauber dynamics: A coordinate I is chosen uniformly at random, and aj is replace 
by a'j drawn from the conditional distribution of the I-th coordinate given {aj : j ^ I}- Let 

n 

F{(T, cr') := ^(cji - cj-) = aj - a'j. 

i=l 

For each i = 1,2, ... ,n, define rui = mi{cr) = ^H^i computaion gives that 

E((Ti|{(Tj, j / i}) = tanh(mi) for all i and so we have 



^ n 1 

/(cr) := E(F(cr,cr')|cr) = m y^tanh(mi) = — - ^ girui 



n — ' n n 

j=l i=l 

where g{x) := x — tanh(x). Note that |mj— m| < 1/n, and hence /(cr) = m — tanhm-|-0(l/n). 
A simple analytical argument using the fact that, for x w 0, x — tanhx = -|- O(x^) then 
gives 

A(.)< ^1/(^)1^/3 + ^ 
and using Corollary [s] with a = 2/3, i? = 6/n and C = we have 

P(|m - tanhml > t + n"^) < P(|/(cr)| > t) < 26"'="**^" 

for all t > for some constant c > 0. It is easy to see that this implies the result. The critical 
observation, of course, is that x — tanh(/3x) = 0{x^) for j3 = 1, which is not true for /3 7^ 1. 

2.2. Example: Triangles in Erdos-Renyi graphs. Consider the Erdos-Renyi random 
graph model G{n,p) which is defined as follows. The vertex set is [n] := {1,2, . . . ,n} and 
each edge (i, j), 1 < i < j < n is present with probability p and not present with probability 
1 — p independently of each other. For any three distinct vertex i < j < k m. [n] we say that 
the triple {i,j, k) forms a triangle in the graph G{n,p) if all the three edges (i, j), (j, k), (i, k) 
are present in G{n,p) (see figure [T]). Let T„ be the number of triangles in G{n,p), that is 

Tn := l{(i, j, k) forms a triangle in G{n,p)}. 

l<i<j<k<n 

Let us define the function /(•, •) on (0, 1) x (0, 1) as 

r 1 — r 

/(r, s) := r log - -I- (1 — r) log . (4) 

s 1 — s 
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Figure 1. A graph on 6 vertices with 8 edges and 3 triangles. The triangles 
being (1,2,3), (1,3,4) and (1,3,6). 



Note that /(r, s) is the Kullback-Leibler divergence of the measure Vg from I'r and also the 
relative entropy of Vr w.r.t. i^s where Up is the Bernoulli(p) measure. We have the following 
result about the large deviation rate function for the number of triangles in G{n,p). 

Theorem 7. Let Tn be the number of triangles in G{n,p), where p > po where po = 2/(2 + 
e^/^) ~ 0.31. Then for any r G (p, 1], 

ri\ ,\ / n'^I{r,p) 



P T„, > 



exp 



3 J ) --\ 2 + (5) 

Moreover, even if p < po, there exist p' ^p" such that p < p' < p" < I and the same result 
holds for all r € {p,p') U {p" , 1]. For all p and r in the above domains, we also have the more 
precise estimate 

n^I{r,p) 



P 



Tn 



n 



< C{p,r)n^/^ 



exp 



-(l + 0(n-^/2)) 



(6) 



where C{p,r) is a constant depending on p and r. 




Figure 2. The set (colored in gray) of {p,r),r > p for which we are able to 
show that the large deviation result holds. 

The behavior of the upper tail of subgraph counts in G(n, p) is a problem of great inter- 
est in the theory of random graphs (see (6| [25|[27||28[|39] , and references contained therein). 
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The best upper bounds to date were obtained by Kim and Vu f28l (triangles) and Janson, 
Oleszkiewicz, and Rucihski ^26j (general subgraph counts). For triangles, the results of these 
papers essentially state that for a fixed e > 0, 

exp(-e(nVlog(l/p))) < P(r„ > E(T„) + enV) < exp(-e(nV))- 

Clearly, our result gives a lot more in the situations where it works (see Figure [2]). The method 
of proof can be easily extended to prove similar results for general subgraph counts and are 
discussed in Subsection 2.3 However, there is an obvious incompleteness in Theorem [7] (and 



also for general subgraphs counts), namely, that it does not work for all {p,r). 

In this context, we should mention that another paper on large deviations for subgraph 
counts by Bolthausen, Comets and Dembo fs] is in preparation. As of now, to the best of 
our knowledge, the authors of [Sj have only looked at subgraphs that do not complete loops. 



like 2-stars. Another related article is the one by Doring and Eichelsbacher 15 , who obtain 
moderate deviations for a class of graph-related objects, including triangles. 

Unlike the previous two examples, Theorem [7] is far from being a direct consequence of any 
of our abstract results. Therefore, let us give a sketch of the proof, which involves a new idea. 

The first step is standard: consider tilted measures. However, the appropriate tilted mea- 
sure in this case leads to what is known as an 'exponential random graph', a little studied 
object in the rigorous literature. Exponential random graphs have become popular in the 
statistical physics and network communities in recent years (see the survey of Park and New- 
man [33]). The only rigorous work we are aware of is the recent paper of Bhamidi et. al. [s], 
who look at convergence rates of Markov chains that generate such graphs. 

We will not go into the general definition or properties of exponential random graphs. Let 
us only define the model we need for our purpose. 

Fix two numbers /3 > and /i G R. Let ft = {0, Ij^a) be the space of all tuples like 
X = {xij)i<i^j<n, where Xij G {0, 1} for each Let X = (Xjj)i<j<j<„ be a random element 
of fl following the probability measure proportional to e^^^\ where H is the Hamiltonian 

H(x) — ^ ^ XijX ji^x^i^ -\- h ^ ^ Xij. 

l<i<j<k<n ^<i<j<n 

Note that any element of Q naturally defines an undirected graph on a set of n vertices. For 
each X G il, let T(x) = X]i<j<fc ^ij^jk^ik denote the number of triangles in the graph defined 
by X, and let E{x.) = X]i<j ^ij denote the number of edges. Then the above Hamiltonian is 
nothing but 

^ + /.E(x). 

n 

For notational convenience we will assume that Xij = xji. Let Zn{/3, h) be the corresponding 
partition function, that is 

Z„(/3,/^) = ^e^W. 

Note that /? = corresponds to the Erdos-Renyi random graph with p = e^/{l + e^). The 
following theorem 'solves' this model in a 'high temperature region'. Once this solution is 
known, the computation of the large deviation rate function is just one step away. 

Theorem 8 (Free energy in high temperature regime). Suppose we have /3 > 0, /i G R, and 

Zn{f3, h) defined as above. Define a function ip : [0, 1] — )• R as 
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Suppose P and h are such that the equation u = f{u) has a unique solution u* in [0, 1] and 
2ip{u*)ip' (u*) < 1. Then 



lim 2 = -n^viu ), v?(0)) - - log(l - ip{0)) + ^ , 

where /(•,•) is i/ie function defined in Q. Moreover, there exists a constant K{l3,h) that 
depends only on /3 and h (and not on n) such that difference between logZ„(/3, h) and the 
limit is bounded by K{I3, h)n~^/'^ for all n. 

Incidentally, the above solution was obtained using physical heuristics by Park and New- 
man |34| in 2005. Here we mention that, in fact, the following result is always true. 



Lemma 9. For any (3 > 0,h €z we have 

liminfi5^^4^ > sup |-kr,V^(0))-Jlog(l-c^(0)) + ^^ (7) 
re{0,l) I ^ ^ o 

sup <! -Imu), ^(0)) - 1 iog(i - + 



u:ip(u)'^=u 



2 '"^^ " 2 °' 6 



We will characterize the set of /3, h for which the conditions in Theorem [8] hold in Lemma 12 
First of all, note that the appearance of the function (p{u)'^ — -u is not magical. For each i < j, 
define 

Lij = — XikXjk- 

k^{i,j} 

This is the number of 'wedges' or 2-stars in the graph that have the edge ij as base. The key 
idea is to use Theorem [T] to show that these quantities approximately satisfy the following set 
of 'mean field equations': 

v{Lik)f{Ljk) for all i < j. (8) 

(The idea of using Theorem [T] to prove mean field equations was initially developed in Sec- 
tion 3.4 of [lO].) The following lemma makes this notion precise. Later, we will show that 
under the conditions of Theorem [8| this system has a unique solution. 

Lemma 10 (Mean field equations). Let ip be defined as in Theorem^ Then for any 1 < i < 
j < n, we have 



P 



i^/nLij-^ ^ ip{Lik)(p{Lj 



^jk 

for all t > 8f3/n. In particular we have 

E 



>t\ < 2 exp 



t2 



Lij ip{Lik)tp{Ljk) 



where C is a universal constant. 



In fact, one would expect that Lij ~ u* for all i < j, if the equation 

V'(n) := ^{uf -u = (10) 

has a unique solution u* in [0, 1]. The intuition behind is as follows. Define Lmax = maxjj Lij 
and Ljnin = niinjjLjj. It is easy to see that ip is an increasing function. Hence from the 
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mean-field equations ([s]) we have Lmax < v(-^^max)^ + o(l) or '(/'(-^max) > o(l). But ip{u) > 
iff u < n*. Hence Lmax < u* + o(l). Similarly we have -Lmin > u* — o(l) and thus all Lij ~ u* . 
Lemma 11 formalizes this idea. Here we mention that one can easily check that equation 
(10) has at most three solutions. Moreover, ip{0) > > implies that ip'{u*) < or 

2ip{u*)ip' (u*) < 1 if n* is the unique solution to (10). 

Lemma 11. Letu* be the unique solution of the equation u = ip{u)'^ . Assume that2ip{u*)(p' (u*) < 
1. Then for each 1 < i < j < n, we have 

E \U, -u\< 

where K{l3,h) is a constant depending only on I3,h. Moreover, if 2{p{u*)(p' (u*) = 1 then we 
have 

E iLj,- - u*| < for all 1< i < j < n. 

Now observe that the Hamiltonian H(X.) can be written as 

H(X.) = ^ ^ ^ijLij + h ^ Xij. 



The idea then is the following: once we know that the conclusion of Lemma 11 holds, each 
Lij in the above Hamiltonian can be replaced by u* , which results in a model where the 
coordinates are independent. The resulting probability measure is presumably quite different 
from the original measure, but somehow the partition functions remain comparable. 

The following lemma (Lemma 12) characterizes the region 5 G R x [0,oo) such that the 
equation u = ip{u)'^ has a unique solution u* in [0, 1] and 2ip{u*)ip' (u*) < 1 for (/i,/3) G S (see 
figure |3]) . 

Let Hq = log 2 — I < 0. For h < ho there exist exactly two solutions < a^, = a^{h) < 
1/2 < a* = a*{h) < oo to the equation 

l + x 

logx H — h n = 0. 

Ix 

Define a.t{h) = a*{h) = 1/2 for h = Hq and 

AW = ^ii^and^-W=(l±^ (11) 



for h < Hq. 



Lemma 12 (Characterization of high temperature regime). Let S be the set of pairs {h,/3) 
for which the function 'ip{u) := ^{u)'^ — u has a unique root u* in [0, 1] and 2ip{u*)(p' {u*) < 1 
where ip{u) := e^"+''/(l + 6^^"+^). Then we have 

S"^ = {{h,l3) ■.h<ho and /3*(/i) < /3 < I3*{h)} 

where j3* ,j3^, are as given in equation (11). In particular, {h, (3) £ S if 13 < (3/2)^ or h > Hq. 

Remark. The point h = ho, 13 = f3o := (3/2)'^ is the critical point and the curve 

,(*)=(- log* -_,L_^j (12) 

for t > is the phase transition curve. It corresponds to ^{u*) = and 2ip{u*)tp' {u*) = 1. In 
fact, at the critical point (/io,/3o) the function V'('u) = V'('u)^ ~ ^ has a unique root of order 
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Figure 3. The set S (colored in gray) of {h,(3) for which the conditions of 
Theorem [8] hold. 



three at u* = 4/9, i.e., ^{u*) = ^'{u*) = ^"{u*) = and 'il)"'{u*) < 0. The second part of 



lemma 11 shows that all the above conclusions (including the limiting free energy result) are 



true for the critical point but with an error rate of n Define the "energy" function 

e(r) = h{r, 99(0)) + ^ log(l - <^(0)) - ^ 

appearing in of the r.h.s. of equation ([T]). The "high temperature" regime corresponds to 
the case when e(-) has a unique minima and no local maxima or saddle point. The critical 
point corresponds to the case when e(-) has a non-quadratic global minima. The boundary 
corresponds to the case when e(-) has a unique minima and a saddle point. In the "low 
temperature" regime e(-) has two local minima. In fact, one can easily check that there is a 
one dimensional curve inside the set S'^, starting from the critical point, on which e(-) has two 
global minima and outside one global minima. Below we provide the solution on the boundary 
curve. Unfortunately, as of now, we don't have a rigorous solution in the "low temperature" 
regime. 

For (h, (3) on the phase transition boundary curve (excluding the critical point) the function 
■)/'(•) has two roots and one of them, say u*, is an inflection point. Let u* be the other root. 
Here we mention that u* is a minima of e(-) while v* is a saddle point of e(-). On the lower 
part of the boundary, which corresponds to {7(t) : t < 1/2}, the inflection point v* = (l + t)^^ 
is larger than u* , while on the upper part of the boundary corresponding to {jit) : t > 1/2}, 
the inflection point v* = {l + t)~'^ is smaller than u*. The following lemma "solves" the model 



at the boundary point j{t) (see eqn. 12). 



Lemma 13. Let ^{■),u*,v* be as above and {h,/3) = j{t) for some t 7^ 1/2. Then, for each 
^ 1^ i < j 1^ n, we have 

E(|L.,-n*|)<=^^ (13) 



n 



for some constant K(/3, h) depending on f3, h. Moreover, we have 

logZ„(/3,/i) 1 1 P^{u*f , _i/2. 
= -i^Ii^yu ), </'(0)) - - log(l - (/j(0)) + + 0{n ' ) 
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and 



P 



TrAY) 



exp 



n 



ip{u 



*\3 



3 



(l + 0(n-i/2))), 



(14) 



where Y = ((5^j))i<j follows G{n,ip{0)) and the constant appearing in O(-) and C{f3,h) 
depend only on I3,h. 

In the next subsection we will briefly discuss about the results for general subgraph counts 
that can be proved using similar ideas. 

2.3. Example: General subgraph counts. Let F = {V{F), E{F)) be a fixed finite graph 
on vp := |y(F)| many vertices with ep := \E(F)\ many edges. Without loss of generality 
we will assume that V{F) = [vp] ■= {1,2, . . . ,vf}. Let ap = |Aut(F)| be the number of 
graph automorphism of the graph F. Let Nn be the number of copies of F, not necessarily 
induced, in the Erdos-Renyi random graph G{n,p) (so the number of 2-stars in a triangle 
will be three). We have the following result about the large deviation rate function for the 
random variable Nn- 

Theorem 14. Let Nn be the number of copies of F in G{n,p), where 

Bp — 1 



p> Po-'- 



ep — 1 + exp 



e.p — 1 



Then for any r £ (p, 1], 



P Nn > 



vpl ( n 



exp 



n'^I{r,p) 



(l+0(n-V2)) . 



(15) 



ap \Vpy 

Moreover, even if p < po, there exist p',p" such that p < p' < p" < I and the same result 
holds for all r £ {p,p') U {p" , 1]. For all p and r in the above domains, we also have the more 
precise estimate 



P 



N„. 



Vpl ( n 



ap \vp 



< C7(p,r)n^^-i/2 
n'^I{r,p) 



exp 



(l + 0(n-V2))l 



where C{p,r) is a constant depending on p and r. 

Note that pq as a function oi ep is increasing and converges to 1 as number of edges goes 
to infinity (see Figure [4]). So there is an obvious gap in the large deviation result, namely the 
proof does not work when r > p, p < po and the gap becomes larger as the number of edges 
in F increases. Note that po ^ 1 as ep 



oo. 



The proof of Theorem 14 uses the same arguments that were used in the triangle case. 
Here the tilted measure leads to an exponential random graph model where the Hamiltonian 
depends on number of copies of F in the random graph. Let /3 > 0, /i G IR be two fixed 
numbers. As before we will identify elements of Q := {0, Ij^a) with undirected graphs on a 
set of n vertices. For each x G $7, let A'^(x) denote the number of copies of F in the graph 



defined by x, and let E{x) = ^ 



i<j ^ij 



denote the number of edges. Let X = {X, 



be 
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a random element of following the probability measure proportional to e^^^\ where H is 
the Hamiltonian 

i/(x) = ^ iV(x) + hE{^). 

where {n)m = (^n-my. • R-^call that vp is the number of vertices in the graph F. The scaling 
was done to make the two summands comparable. Also we used (re — 2)^^-2 instead of n"^ 
to make calculations simpler. Let Zn{/3,h) be the partition function. Note that A^(x) can be 
written as 

iV(x) = ^ E n -Utr (16) 

l<ti,t2,-,Up<n,(i,j)eEiF) 
U^tj for ijtj 

For X G < i < j < n, define xj^^.^ as the element of which is same as x in every 
coordinate except for the (i,j)-th coordinate where the value is 1. Similarly define jy For 
i < j, define the random variable 

^ ._ iV(Xy)-iV(XO,)) 
■ {n- 2)„^_2 

The main idea is as in the triangle case. We show that Lj^'s satisfy a system of "mean- field 
equations" similar to Q which has a unique solution under the condition of Theorem 15 In 
fact, we will show that i^ij" ~ "u* for all i < j and £'(X)" ~ " {^^^if(u*) under the condition 
of Theorem [HI Now note that we can write the hamiltonian as 

i/(X) = — ^ ^ij^ij + h ^ Xij 

which is approximately equal to h*E{'X.) where h* = h + f3u* /ep. Now the remaining is a 
calculus exercise. 

So the first step in proving the large deviation bound is the following theorem, which 
gives the limiting free energy in the "high temperature" regime. Note the similarity with the 
triangle case. 

Theorem 15. Suppose we have /? > 0, /i G IR, and Zn{l3, h) defined as above. Define a 
function : [0, 1] — )• K, as 
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Suppose j3 and h are such that the equation apu = 2epip{u)^^^^ has a unique solution u* in 
[0, 1] and 2eF{ep - l)(^(n*)''^" < ap. Then 

n-s>oo 2 2 ap 

where /(•,•) is the function defined in Q. Moreover, there exists a constant K{(3,h) that 
depends only on /3 and h ( and not on n) such that difference between n^^ log Zn{/3, h) and the 
limit is bounded by K{P, h)n~^/'^ for all n. 



Here also we can identify the region where the conditions in Theorem 15 hold. Let 

/lo = log(ep - 1) - (17) 
ep — 1 

For h < ho there exist exactly two solutions < a* = o=i,(/i) < 1/2 < a* = a*{h) < oo of the 
equation 

1 + X 

log X + -. — + h = 

[Bp — 1)X 

Define a^,{h) = a*{h) = l/{ep — 1) for h = ho and 

m) = ^^4±^ and = ^^4±^ (18) 

2ep{ep-l)a^ 2ep{ep-l)a* ^ ' 

for h < hQ. 

Lemma 16. Let S be the set of pairs {h,P) for which the function 

ip{u) := 2epip{u)^^^^ — apu 

has a unique root u* in [0, 1] and 2ep{ep — l)(p{u*)^''~'^ip'{u*) < ap where (p{u) := e^^~^^/{l + 
^I3u+hy xhen we have 

= {{h,l3) ■.h<ho and /3,(/i) < (3 < P*{h)} 



where Hq, /3* , /3^: are as given in equations (17), (18). In particular, (/i,/3) £ S if 

ep-l 

ape/ 
^^ 2(e/l)e. 



In fact Lemma 16 identifies the critical point and the phase transition curve where the 
model goes from ordered phase to a disordered phase. But the results above does not say 
what happens at the boundary or in the low temperature regime. However note that the 
mean-field equations hold for all values of (3 and h. 

2.4. Example: Ising model on Z'^. Fix any (3 > 0,h £ H and an integer d > 1. Also 
fix n > 2. Let B = {1, 2, . . . ,n + 1}"^ be a hypercube with (n + 1)*^ many points in the 
d-dimensional hypercube lattice Z*^. Let Q be the graph obtained from B by identifying the 
opposite boundary points, i.e., for x = {xi,X2, ■ ■ ■ , Xii),y = {yi,y2, ■ ■ ■ , yd) £ B we have x is 
identified with y if Xi — yi £ {—n, 0, n} for all i. This identification is known in the literature 
as periodic boundary condition. Note that 0, is the d-dimensional lattice torus with linear 
size n. We will write x ^ y for x,y £ \i x,y are nearest neighbors in $7. Also let us denote 
by Nx the set of nearest neighbors of x in 0, i.e., = {y G 17 : y ~ x}. 

Now consider the Gibbs measure on {+1,-1}^ given by the following Hamiltonian 

H{a) := j3 ^ axCTy + h'^ax 
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where <t = {ax)xen is a typical element of {+1, —1}^. So the probability of a configuration 
a G {+1,-1}^ is 

IJ'fi,hiW}) ■= exp {H{a)) = Z^j^ exp /3 ^ a^cTy + /i ^ j (19) 

where Zp^h = X^o-ef+i -i}" s^^*^^ is the normalizing constant. Here Gx is the spin of the 
magnetic particle at position x in the discrete torus VL. This is the famous Ising model of 
ferromagnetism on the box B with periodic boundary condition at inverse temperature /3 and 
external field h. 

The one-dimensional Ising model is probably the first statistical model of ferromagnetism to 
be proposed or analyzed ^24]. The model exhibits no phase transition in one dimension. But 
for dimensions two and above the Ising ferromagnet undergoes a transition from an ordered 
to a disordered phase as (3 crosses a critical value. The two dimensional Ising model with 
no external field was first solved by Lars Onsager in a ground breaking paper ^32j, who also 
calculated the critical /3 as /3c = sinh~^(l). For dimensions three and above the model is yet 
to be solved, and indeed, very few rigorous results are known. 

In this subsection, we present some concentration inequalities for the Ising model that 
hold for all values of (5. These 'temperature-free' relations are analogous to the mean field 
equations that we obtained for subgraph counts earlier. 

The magnetization of the system, as a function of the configuration cr, is defined as m{a) : = 
PI Sxeo -^^'^ each integer k G {1, 2, . . . , 2d}, define a degree k polynomial function rfc(cr) 
of a spin configuration a as follows: 

r,H:=(f/)|l^|)"E E -S (20) 

where as = na;eS'^^ S" C 0. In particular r^^o-) is the average of the product of 

spins of all possible k out of 2d neighbors. Note that ri[a) = m{cr). We will show that when 
/i = and n is large, m[a) and rfc(cr)'s satisfy the following "mean-field relation" with high 
probability under the Gibbs measure: 



(1 - ^o(/3))m(,T) « 0mr2k+i{cr). (21) 



k=l 

These relations hold for all values of /? > 0. Here ^^'s are explicit rational functions of tanh(2/3) 



for A; = 0, 1, . . . , (i — 1, defined in equation (22) below. (Later we will prove in Proposition 19 



that an external magnetic field h will add an extra linear term in the above relation (21).) 
The following Proposition makes this notion precise in terms of finite sample tail bound. It 
is a simple consequence of Theorem [TJ 

Theorem 17. Suppose a is drawn from the Gibbs measure ^i-pfl- Then, for any f3 > 0,n > 1 
and t > we have 



d-l 



(1 - 0o(/3))m(o-) - 5^^fc(/3)r2fe+i(cr) 



k=l 



>t \ < 2 exp 



46(/3) 
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where m{a) := J^'^^^QO^x is the magnetization, rf^{a) is as given in (20) and for k = 
0,l,...,d-l 

^ ^ cre{-l,+l}2d \ i=l / j=l 

d-1 

and b{(3) = |1 - + ^(2A; + iMm. (22) 

k=l 

Moreover, we can explicitly write down 9o{(3) as 

W) = ^i:^C + J tanh(2fc/3) 
k=i ^ ^ 

and for d > 2 there exists /3i G (0,oo), depending on d, such that 1 — 6o{P) > for (3 < f3i 
and 1 - eo{f3) < for 13 > pi. 

Here we may remark that for any fixed k, 9k{f3/2d) converges to the coefficient of x"^^^^ in 
the power series expansion of tanh(/3x) and 2d/3i{d) J, 1 as d — )■ oo. For small values of d we 
can explicitly calculate the ^^'s. For instance, in d = 2, 

Ooi/S) = ^ (tanh(4/3) + 2tanh(2/3)) , 6*1 = ^ (tanh(4/3) - 2tanh(2/3)) . 

For d = 3, 

= — (tanh(6/3) + 4tanh(4/3) + 5tanh(2/3)) , 
16 

eUp) = — (tanh(6/3) - 3tanh(2/3)) , 
16 

e2{(3) = — (tanh(6/3) - 4tanh(4/3) + 5tanh(2/3)) . 
16 

6»o(/3) = — (tanh(8/3) + 6tanh(6/3) + 14tanh(4/?) + 14tanh(2/3)) , 
16 

7 

6»i(/3) = — (tanh(8/3) + 2tanh(6/3) - 2tanh(4/3) - 6tanh(2/3)) , 
16 

7 

6»2(/3) = — (tanh(8/3) - 2tanh(6/3) - 2tanh(4/3) + 6tanh(2/3)) , 

6»3(/3) = — (tanh(8/3) - 6tanh(6/3) + 14tanh(4/3) - 14tanh(2/3)) . 
16 

Corollary 18. For the Ising model on at inverse temperature (3 with no external magnetic 
field for all t > we have, 

(i) tfd = l, 

P(|m(cr)| >t)< 2exp ( - tanh(2/3))t2 



For d = 4, 



(ii) tfd = 2, 

( \^\t^ 

P(|[(l -u)2 -n^]m(cr) +uV3(cr)| > t) < 2exp 



32 
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where u = tanh(2/3) and T^[a) = ^jpy 'YT '^x^yO'z where the sum ^* is over all x,y,z ^ 
such that \x — y\ = 2, \z — y\ = 2, |x — z| = 2. 
(iii) ifd = 3, 

Fi\g{u)m{a) + 5u^ {I + u'^y^icr) - 3ttV5(cr)| > t) < 2exp (-cjOlt^) 

where c is an absolute constant, g{u) = 1 — 3u + — 9m'^ + 3n'* — 3u^, u = tanh(2/3) 
and ra , r^ are as defined in ( 20 ) . 

Although we do not yet know the significance of the above relations, it seems somewhat 
striking that they are not affected by phase transitions. The exponential tail bounds show 
that many such relations can hold simultaneously. For completeness, we state below the 
corresponding result for nonzero external field. 

Proposition 19. Suppose cr is drawn from the Gihhs measure fi/^^h- Let ri.{cr), 9k{P), b{f3) 
be as in proposition (17). Then, for any /3>0, /iGR, n>l and t >0 we have 

P (1(1 - eomm{a) - g{a)\ >t)< 2exp f- J^'!l^ , ) (23) 

where 



and 



46(/3)(l + tanh|/i|) 
d-l 

g{a) := ^ ^fc(/3)r2fc+i(cr) + tanh(/i) ( 1 - ^fc(/3)s2fc+i(o- 

k=l \ k=0 

^ ^ ^ ^ -Y-cO Cr^ AT Id — i. 



d-l / d-l 



xenscN:,,\s\=k 

is the average of products of spins over all k-stars for k = 1,2, . . . ,2d and Q is the discrete 
torus in TL'^ with many points. 

3. Proofs 

3.1. Proof of Proposition |4[ Instead of proving Theorem [2] first, let us see how it is applied 
to prove the result for the Curie- Weiss model at critical temperature. The proof is simply an 



elaboration of the sketch given at the end of Subsection 2.1 



Suppose <T is drawn from the Curie- Weiss model at critical temperature. We construct cr' 
by taking one step in the heat-bath Glauber dynamics: A coordinate / is chosen uniformly at 
random, and o"/ is replace by a'^ drawn from the conditional distribution of the /-th coordinate 
given {oj : j I}. Let 

n 

F{a, a') := ^(cJi - cr-) = <t/ - a'j. 

i=l 

For each i = 1,2, ... ,n, define rrii = mi{cr) = Yji^i computaion gives that 

E((Tj|{(Tj, j 7^ i}) = tanh(mj) for all i and so we have 



^ n 1 " 

/(cr) := E(F(cr, (T'')\a-') = m } tanh(mi) = 1 — > ginii 



n — ' n n 



where g{x) := x — tanh(x). By definition mi{cr) — m{a) = Oijn and mi{cr') — m{a) = 
{ai + aj — (y'j)/n for all i. Hence using Taylor expansion upto first degree and noting that 
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= tanh^(x) < we have 

\f{cT) - f{cr')\ < -\g'{micr))\ + ^ ^' 

< -m{cTy + ^. 

Clearly |F(cr,cr')| < 2. Thus we have 

A{a) := I E[\f{cT) - " IH^, 't')! I ^] < -m{a? + \. 

I n 

Now it is easy to verify that \x\^ < 5|x — tanhx| for all |x| < 1. Note that this is the place 
where we need /3 = 1. For /3 7^ 1, the linear term dominates in m — tanh(/3m). Hence it 
follows that 



m 



(cr)2 < 52/V(cr) - tanhm(cr)|2/3 < 3|/(cr)|2/3 + 3^- 



2/3 



where in the last line we used the fact that |/(cr) — (m — tanhm)| < 1/n and 5^/'^ < 3. Thus 



and using Corollary [s] with a = 2/3, i? = 6/n and C = 12/n^/^ we have 

P(|m - tanhml > t + n'^) < P(|/(cr)| > t) < 26"'="**^" 
for all t > for some constant c > 0. This clearly implies that 

P(|m| >t)< P(|m - tanhml > t^/S) < 26"''''** 
for all t > and for some absolute constant c > 0. Thus we are done. 

3.2. Proof of Proposition [6j The proof is along the lines of proof of proposition [4j Suppose 
X is drawn from the distribution We construct X' as follows: a coordinate / is chosen 
uniformly at random, and Xi is replace by X'^ drawn from the conditional distribution of the 
/-th coordinate given {Xj : j ^ I}. Let 



F(X,X') := ^{X, - XI) =Xi- X\. 



For each i = 1,2, ...,n, define mj(X) = n ^ "^j^iXj. An easy computaion gives that 
E{Xi\{Xj, j / i}) = g{mi) for alH = 1, 2, . . . , n where g{s) = ^(log /exp(x^/2n + sx) dp{x)) 
for s G P. So we have 

1 " 

/(X) := E(F(X,X')|X) = m(X) - - V<7(m,(X)). 

n ^-^ 

i=l 

Define the function 

h{s) = — — log / exp(s2;) dp{x) for s G P. (24) 

Clearly h is an even function. Recall that k is an integer such that h^^"^ (0) = for < i < 2k 
and /i(2fc)(0) / 0. We have k>2 since h"{0) = 1 - J x"^ dp{x) = 0. 
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Now using the fact that p{[—L,L\) = 1 it is easy to see that |/(X) — h'{m{%))\ < c/n for 
some constant c depending on L only. In the subsequent calculations c will always denote a 
constant depending only on L that may vary from line to line. Similarly we have 



|/(X)-/(X')|< 



\Xi-X'j\ 



n 



l-5'(m(X))| + 



c(l + SUP|^|<i|fi("(x)|) 



n 



<^|/i"Mx))| + 4. 



Note that |/i"(s)| < cs^^""^ for some constant c for all s > 0. This follows since lims_j.o h"{s) / s^^^"^ 
exists and h"{-) is a bounded function. Also \\ms^f)\h' {s)\/\s\'^^~^ = \h'^'^^\Q)\ / and 
|/i'(s)| > for s > 0. So we have > clsp^'^"*^ for some constant c > and all |s| < L. 

From the above results we deduce that 



|/(X) - /(X')| < -|(m(X))|^^-^ + ^ < -|/i'(m(X))|^ + ^ 

Now the rest of the proof follows exactly as for the classical Curie- Weiss model. 
3.3. Proof of Theorem [7[ First, let us state and prove a simple technical lemma. 
Lemma 20. Let xi, . . . , x^, yi, . . . ,yk be real numbers. Then 



max 

l<i<n 



Eft 'y ■ \ ^ ft r} I ■ 



and 



log J]] e^' - log J]] e^" 



i=l 

Proof. Fix 1 < i < /c. For t G [0, 1], let 

h{t)-- 

Then 



i=l 



< 2 max \xi 

l<i<n 



< max \xi 

l<i<k 



,txi+{l-t)yi 



h'{t) 



[Xi - Vi) 



txj+{l—t)yj 



h{t). 



This shows that < 2maxj |xj — yi\ for all t G [0, 1] and completes the proof of the first 

assertion. The second inequality is proved similarly. □ 



Proof of Lemma 10, Fix two numbers 1 < i < j < n. Given a configuration X, construct 
another configuration X' as follows. Choose a point k G {1, . . . , n}\{i,j} uniformly at random, 
and replace the pair (Xj^, Xji^) with (X^'^, Xj^) drawn from the conditional distribution given 
the rest of the edges. Let L'- be the revised value of Ljj. From the form of the Hamiltonian 
it is now easy to read off that for x,y G {0, 1}, 

F{Xi, = x,X'^, = y\X) 

(X exp( PxLik + PyLjk + hx + hy - -xXijXjk - -yX^Xik + -xyXij] . 
\ n n n J 
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An application of Lemma 20 shows that the terms having /3/n as coefficient can be 'ignored' 



in the sense that for each x,y G {0, 1}, 
P(X,', = x,Xj, = y|X) 



In particular, 



Now, 



(1 + eP^^k+h'^{i _|_ el^Ljk+h-^ 
2/3 



< 



2/3 



n 



\E{Xl,X',\X)-^iUkML,k)\< 



n 



(25) 



E{Lij - L'ij I X) = _ {XikXjk - E{X[kX'jk I X)) 

lh\fh Zi j 



n-2 



~ n{n- 2) ^ |X). 



(26) 



Let F(X,X') = (n - 2)(Lij - L^ -) and /(X) = E(F(X,X') | X). Let 



From ( 25 ) and ( 26 ) it follows that 



|/(X)-5(X)|< 



2/3 



n 



(27) 



Since X' has the same distribution as X, the same bound holds for \f{X') — g[X')\ as well. 
Now clearly, \F{X,X')\ < 1. Again, \g{X) - giX')\ < 2/n, and therefore 

4(1 + /3) 



\f{X)-fiX')\< 



n 



Combining everything, and applying Theorem [T] with B = Q and C = 2(1 + li)/n^ we get 



P(|/(X)| >t) < 2exp 



4(1 + /3) 



for all t > 0. From (27) it follows that 



P(b(X)| > t) < P(|/(X)| > t - 2/3/n) < 2exp 



8(1 + /3) 

for all t > 8/3/n. This completes the proof of the tail bound. The bound on the mean absolute 
value is an easy consequence of the tail bound. □ 



Proof of Lemma 11 , The proof is in two steps. In the first step we will get an error bound of 
order n~^^^^/^ogn. In the second step we will improve it to Define 



A = max 

1< j<jr<n 



^"■^ ~ n V'(-^^ifc)'/'(-^jfc! 



By Lemma 10 and union bound we have 

P (A > t) < exp 
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for all t > 8/3/n. Intuitively the above equation says that A is of the order of y^log n/n, in 
fact we have E(A^) = 0(logn/n). Clearly ip is an increasing function. Hence we have 

f{LmmY ~ ^ ^ ^min ^ ^max ^ '/'(-^max)^ + ^ 

where Lmax = maxi<i<j<„ Lij and Lmin = mini<i<j<„ Lij. 

Now assume that there exists a unique solution u* of the equation <f{u)'^ = u with 
2ip{u*)(p' (u*) < 1. For ease of notations, define the function 'tp{u) = ^{u)"^ — u. We have 
V'(O) > > "0(1)) u* is the unique solution to il){u) = and il^'{u*) < 0. It is easy to see 
that il^'{u) = has at most three solution {iIj'{u) = 2/3(/?(it)^(l — (p{u)) — 1 is a third degree 
polynomial in <p{u) and is a strictly increasing function). 

Hence there exist positive real numbers e, 6 such that > £ \u — u*\ > 6. Note that 

ip{u) > if u < u* and ^{u) < is m > n*. Decreasing e, 5 without loss of generality we can 
assume that 



inf 

0<\u-u*\<5 



U — U 

-i){u) 



This is possible because < 0. Note that tp{Liji 

have 

U* - 6 < Lniin < -Z^max < U* + 6 



c> 0. (28) 
x) > —A and ip{Lmm) < A. Thus we 



when A < e. Using (28), u* < -Lmax < u* + 6 implies that |-Lmax — u*\ ^ cA and u* — 5 < 
Lmin ^ u* implies that iLmin — w*! < cA. Thus, when A < e, we have |Ljnax — u*\ < cA and 
l-^min — u*\ < cA and in particular, \Lij — u*\ < cA for all i < j. So we can bound the 
distance of L^j from u* by 



u 



< c^E(A^) + P(A >e)< K{f3,h)- 



n 



for all i < j- 

Now let us move to the second step. Recall from Q that 

C(l + /3)i/2 

n 



E 



n 



1/2 



(29) 



for all i < j. Let -Djj = Ljj — u*. Using Taylor expansion around u* upto degree one we have 

ip{Lik)ipiLjk) - ^{u*f = ip{u*){f{Lik) - ^(u*)) + ip{u*){^p{Ljk) - ^{u*)) 

= {u"){Dik + Djk) + Rijk 

where E{\Rijk\) < CW.{Df-) < CrT^ logn for some constant C depending only on /3, h. Thus 



E 



^^^~n ^ 'PiLik)v{Ljk) - Dij + 



2u* 

< + 

n n 



n 



- ^ '^iRijk 



< 



Clogn 



n 



(30) 



Here we used the fact that u* = ip{u*)'^. Combining (29) and (30) we have 



E 



n 



< 



c 



n 
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21 



EIL, 



u 



1 



1 - 2ip{u*)ip'{u*) ^ 



C _ K{(3,h) 



n 



where h) is a constant depending on /3, h. 

When 'iIj{u) = has a unique solution at u = u* with 2'ip{u*)^l:' {u*) = 1, which happens at 
the critical point /3 = (3/2)^, h = log 2 — 3/2, instead of equation (28) we have 



inf 

0<|n-n*|<<5 



{u - U*) 



c> 



since 'iIj{u*) = ip'{u*) = ^"{u*) = and %l)"'{u*) < 0. Then using a similar idea as above one 
can easily show that 

E\Lij - u*\ < K{(3,h)n-'^/^ 
for some constant K depending on f3, h. This completes the proof of the Lemma. 



Remark. The proof becomes lot easier if we have 

c:=(^ l • sup ^^}r^<:^- 

o<x<i \x-u*\ 2 

This is because, by the triangle inequality we have 



□ 



(31) 



\Lij - n*| < ^ Ljj - - ^ (p{Lik)(p{Ljk] 

i<j i<j k<^{i,j} 



U + 



2n* 



. n 



n 



(32) 



Now recall that condition (31) says that ip{l)\(p{x) — ip{u*)\ < c\x — u*\ for all x £ [0,1] 
Moreover Lij £ [0, 1] for all i, j, and u* = ip{u*)'^. Thus, 

\ip{Lik)f{Ljk) -u*\< c\Lik - u*\ + c\Ljk - u*\. 
Combining everything we get 



u* < 



i<j 



1 - 2c 



Taking expectation on both sides, and applying Lemma \T0\ we get 

C(l + /3)n3/2 



i<j 



1 - 2c 



And this gives the required result. In fact using basic calculus results one can easily check 
that condition (31) is satisfied when /i > or /3 < 2. 

Now we will prove that in the exponential random graph model, the number of edges and 
number of triangles also satisfy certain 'mean-field' relations. 

Lemma 21. Recall that E(x.) and T(x) denote the number of edges and number of triangles 
in the graph defined by the edge configuration x G 17. If^is drawn from the Gibbs' measure 
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in Theorem^ we have the hound 



E 



E 



T(X) 1^ 

Kj 



< C(l + /3)^/2^ 



where and C is a universal constant. 
Proof. It is not difficult to see that 

E(Xjj I {Xki)(k,i)^{i,j)) = ^(Lij). 

Let us create X' by choosing I < i < j < n uniformly at random and replacing Xij with 
X'- drawn from the conditional distribution of Xij given {Xki)[k,i)^(ij)- Let F(X,X') = 
(2)(^^.-^.',)- Then 

/(X) = E(F(X,XO|X) = Y,{Xui - ^{Lki)) = E{X) -J^viLki). 

k<l k<l 

Now |F(X,X')| < (2) and |/(X) - /(X')| < 1 + /3. Here we used the fact that \ip'{x)\ < /3/4. 
Combining the above result and Theorem [l] with B = 0,C = ^(l + /?) (2) > we get the required 
bound. 

Similarly, if we define F(X,X') = {^{XijLij — X^jLij). Then 

/(X) = E(F(X,X')|X) = Y^i^kiLki - ^{Lki)Lki) 

k<l 

3 

TpCj - ^if{Lki)Lki. 

k<l 



n 



Again, |F(X,X')| < (2) and |/(X) - /(X')| < C(l + (3). The bound follows easily as before. 

□ 



The following result is an easy corollary of Lemma [TT] and Lemma 21 



Corollary 22. Suppose the conditions of Theorem^ are satisfied. Then we have 



E 



n 



where C is a constant depending only on f3,h. 



Lemma 23. Suppose the conditions of Theorem |8] are satisfied. Let Tn be the number of 
triangles in the Erdos-Renyi graph G{n,ip{0)). Then there is a constant K{l3,h) depending 
only on (3 and h such that for all n 



logP(|r„ - QXn*)^! < i^(/3,/i)n5/2) -I{^{u*),^{Q)) 



< 



K{P,h) 



n 



Proof. Let X be drawn from the Gibbs' measure in Theorem [8] with parameters /3,/i. From 
corollary 22 we see that there exists a constant K{/3, h) such that (for all n) 



P 



E{X) 



< K{l3,h)n^/A > 
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and 



Now let 



P 



T{X) n^^{u*) 



*\3 



n 



6 



A= <^xG {0,ir : 



T{x) n^if{u*f 



n 



and 



5 = An <^ X G {0,1}" : 



E{x) 



6 



< K{/3, h)n^''^ 



Now suppose Y = (5^j)i<i<j<n is a collection of i.i.d. random variables satisfying P(lij = 1) = 
1 — P(lij = 0) = 93(0) and Z = {Zij)Ki^j<n is another collection of i.i.d. random variables 
with T{Zij = 1) = 1 — P{Zij = 0) = ^p{u*). Without loss of generality we can assume 
that K{f3,h) was chosen large enough to ensure that (again, for all n) P(Z £ A) > 1/2 and 
P(Z £ B) > 1/2. Now, it follows directly from the definition of A and Lemma 20 that 

log Y: e'""^^^ - log Y: e"^^'^^^^ + ^if^ 



logJ^e'^^W+^^^^-logJ^e . 
T{x) n'^ip{u*f 



< /3max 
xeA 



n 



6 



'-^+hE{x) 



(33) 



Next, observe that 



log J2 e^+'^^(-) - log Y e^+'^^(-) 
= |logP(X eA)\< |log(3/4)|. 



iST(x) 



(34) 



Similarly we have 



log Y e^+''^(^) - log Y e^^'^^^^) 

x£B x£fl 

= |logP(XGi?)|<|log(l/2)| 



0T(x) 



(35) 



where we used the fact that P{X £ AnC) >F{X £ A) + F{X £ C) - I. Combining the last 
two inequalities, we get 



log^e^+^^(^)-log5]e^ 



0T(x) 



-hE(x) 



xgA 



x&B 



< log(8/3). 



(36) 



Next, note that by the definition of B and Lemma 20, we have that for any h' , 

m^+hE(x) n'^jh-h'Mu*) l3n^ip{u*Y 
2 6 



log^e n 

xeB 

(3T{x) 



log^e'^'^(-) 

xeB 



< sup 

xeB 



n 



6 



< {l3 + \h-h'\)K{^,h)n^/^. 



(37) 
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Now choose h' = log -, "^^^ L . Then 



log^e'^'^(^)-log^e'''^(^) 



logP(Z eB)\< log 2. 



Adding up ( p3| ) , p6| ) , ( |37| ) , and ( 38 ) , and using the triangle inequality, we get 



log 5^ 



logj^e^'^^^' 



(38) 



(39) 



x£A xeQ 

where K'{/3, h) is a constant depending only on /3, /i. For any s G R, a trivial verification 
shows that 



log ^ e 



s£;(x) 



log(l + e^). 



Again, note that logP(y G A) = log^^jg^ QhE{x) — log^^g^ ^hE(x) ^ Therefore it follows from 
inequality (39) that 

K'iP,h) 



logP(y G A) {h - h')ip{u*) +\og{l + e''') - log{l + e^) 



< 



n 



Now h = log and h' = log ^^^J) . Also, log(l + e^) = - log(l - v?(0)) and log(l + e'^') 

— log(l — (p{u*)). Substituting these in the above expression, we get 



logP(yGA) -/((^(n*),(^(0)) 



< 



K'{P,h) 



n 



This completes the proof of the Lemma. 



□ 



We are now ready to finish the proof of Theorem [8) 



Proof of Theorem^ Note that by adding the terms in (35), (37), and (38) from the proof of 
Lemma l23l and applying the triangle inequality, we get 



log Zn{P,h) {h-h'Mu) !3^{uf 1 



log(l + e 



This can be rewritten as 



\0gZn{P,h) /((^(U),(^(0))+log(l-(^(0)) ^ifjuf 

2 ^ 2 6 



< 



< 



K{l3,h) 



n 



K{(3,h) 



n 



□ 



This completes the proof of Theorem [8j 

Note that the proof of Theorem |8] contains a proof for the lower bound in the general case. 
We provide the proof below for completeness. 

Proof of Lemma^ Fix any r G (0, 1). Define the set Br as 

r(x) n^r^ 



Br = <x£{0, 1}" : 



n 



6 



E{x) 



2 

n r 



< K{r)n^/'^ 



< K{r)n^/'^, 

where K{r) is chosen in such a way that P(Z G Bj.) > 1/2 where Z = ({Zij))i^j and Z^'s are 

sy to see that 

/3r3 



i.i.d. Bernoulli(r). From the proof of Lemma 23 it is easy to see that 

.2 



log 

xeBr 



e " 



-hE{x) 



~2 



(^{h-h')r + 



+ logfl + 
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where h' = log and K' is a constant depending on (3, h, r. Simplifying we have 

AlogZ„(/3,/.)> Aiog^e^+^^(^) 

xeBr 

Br^ K' 
>'^ + \og{l-p)-I{r,p)--= (40) 



for all r where p = / (1 + e^). Now taking limit as n — t- cxd and maximizing over r we have 
the first inequality Q. Given /?, /i, define the function 

f{r) = — + log(l -p)- I{r, p) 

where p = / (1 + e^) . One can easily check that f'{r) = iff f{u)'^ — u = for u = r'^. From 
this fact the second equality follows. □ 

Lemma 24. Let Tn he the number of triangles in the Erdos-Renyi graph G{n,{p(0)). Then 
there is a constant K{l3, h) depending only on (3 and h such that for all n 

~ 2 ^/n 

Proof. By Markov's inequality, we have 

logP(r„ > Qv{u*f) ^ _A('^^^(^*)3 + lE(e/3'^"/") 



Prom the last part of Theorem [8j it is easy to obtain an optimal upper bound of the second 
term on the right hand side, which finishes the proof of the Lemma. □ 



Proof of Theorem^ Given p and r, if for all r' belonging to a small neighborhood of r there 
exist /3 and h satisfying the conditions of Theorem [s] such that (/^(O) = p and ^{u*) = r', 
then a combination of Lemma 23 and Lemma [24] implies the conclusion of Theorem [7] If 



P > Po = 2/(2 + e^/^), we can just choose /i > /iq = - log 2 - 3/2 such that p = e^/{l + e^) 



and conclude, from Theorem [8} Lemma 23 and Lemma 12, that the large deviations limit 
holds for any /? > 0. Varying /3 between and oo, it is possible to get for any r >p a, (3 such 
that 9?(u*) = r. 

For p < po, we again choose h such that 99(0) = p. Note that h < ho. The large deviations 
limit should hold for any r > p for which there exists /3 > such that r = '^{u*) = ^/v^ and 
(/i, (3) G S. It is not difficult to verify that given h, u* is a continuously increasing function of 



/3 in the regime for which {h, /3) G S. Recall the settings of Lemma 12, Thus, the values of r 
that is allowed is in the set {p,p*)U {p* , 1], where p* ,p* are the unique non-touching solutions 
to the equations 

^ fpl 



This completes the proof of Theorem [7j □ 



Finally, let us round up by proving Lemma 12 



Proof of Lemma 12, Fix /i G R. Define the function 

-0(3;; /i, /3) := (/9(x; /i, /3)^ — x 
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where 



P) = ^ ^ ^p^^^ for X G [0, 1] . 



For simplicity, we will omit /3, h in (p{x; f3, h) and ip{x; 13, h) when there is no chance of confu- 
sion. Note that ■0(0) > > tp{l)- Hence the equation ip{x; f3,h) = has at least one solution. 
Also we have ^p'{x) = 2/?(^(x)^(l — ^{x)) — 1 and f is strictly increasing. Hence the equation 
ip'{x) = has at most three solutions. So either the function ■!/; is strictly decreasing or there 
exist two numbers < a < 6 < 1 such that ip is strictly decreasing in [0, a] U [6, 1] and strictly 
increasing in [a, b]. From the above observations it is easy to see that the equation tp{x) = 
has at most three solutions for any /3, h. If ^{x) = has exactly two solutions then ip' = at 
one of the solution. 

Let u^: = u^:{h, /3) and u* = u*{h,f3) be the smallest and largest solutions of ^{x;h,(3) = 
respectively. If = u* we have a unique solution of iIj{x) = 0. From the fact that 
■^'ip{x;h,f3) > for all x £ [0, 1], /3 > 0, /i G M. we can deduce that given h, u^.{h,f3) and 
u*{h, j3) are increasing functions of (3. Note that is left continuous and u* is right continuous 
in (3 given h. Also note that given /i S H, u* = if /3 > is very small or very large. So we can 
define /3* {h) and (3* (h) such that for (3 < f3^ (h) and for /? > /3* (/i) we have {h,/3) = u* {h,(3). 
/3* is the largest and f3* is the smallest such number. 

Therefore, we can deduce that at (3 = /3*{h), {3* (h) the equation '4'{x; h, (3) = has exactly 
two solutions. Thus we have two real numbers x*, x* G [0, 1] such that 

ip{xf = X and 2/3(^(x)^(l - ip{x)) = 1 

for (x,/3) = (x*,/3*) or {x*,(3*). Thus we have 2f3x{l — ^/x) = 1 and 

/i = log 



2(1 - V^) 

for x = x*,x*. Define a* = x^^^^ - 1 and a* = (x*)"^/^ - 1. Note that x = (1 + a)~'^,l3 = 
(1 + of /2a? for (x, a, f3) = {x^:,a^:, (3^:) or {x*,a*,f3*) and we have 

1 + a / N 

h = - loga - —- 41 
2a 

for a = a^:,a*. Now the function g{x) = — logx — (1 + x)/2x is strictly increasing for x G 
(0, 1/2] and strictly decreasing for x > 1/2. So equation (|41[) has no solution for h > g{l/2) = 



log 2 — 3/2 =: /iQ. For h < Hq equation (41 ) has exactly two solutions and for h = ho equation 



(41) has one solution. One can easily check that /3* < f3* implies that a* < a*. Also from the 



fact that (41) has at most two solutions, we have that for (3 G (/3*,/3*) the equation iIj{u) = 



has exactly three solutions. □ 



3.4. Proof of Lemma |13| , For simplicity we will prove the result only for the lower boundary 
part, that is, for (h, 13) = 'y{t) with i < 1/2. The proof for the upper boundary is similar. 
Fix t < 1/2. Let us briefly recall the setup. The function tp{u) = (p{u)'^ — u has two roots at 
< n* < < 1 and il>'{u^) < while ip'iv*) = 0,1/;" (v*) < 0. 
Define the function 

(3r^ 

fi''^) — — I" fog(i ~ p) ~ i{f-,p) foi^ £ (0, 1). 

o 
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Figure 5. The function -(/'(•) for {h, f3) = 7(1/4). 



From the proof of Lemma [o] and the fact that ip'{u) < for u G {u*,v*) it is easy to see that 
fi^iu*)) > fi^iv*)) and 



logZniP,h)>f{ipiu*)) 



K 



(42) 



where K depends on /3, h. Now, using the same idea used in the proof of Lemma 11 , we have 



P (A > t) < exp 



8(1 + /3) 

for all t > 8/3/n and V'(^max) > — A, ■(/j(Lmin) < A where 



A 



max 

l<j<jr<n 



Lij - - 'p{Lik)f{Ljk] 



Hence there exists eo > 0, c > such that whenever A < eo we have Lmin > — cA and 
either -Lmax <u* + cA or l-Lmax — v*\ < c^/~K. Define 



C/ = {^max< (n*+t;*)/2}. 
Then again using the idea used in Lemma [TT] one can easily show that 



(43) 



E(l 



u • — u* 



< 



n 



1/2 



for all i < j. 



We will show that P(?7'^) < (logn)^/n and it will imply that 

E{\Li, -u*\)< E{lu ■ \Lij - u*\) + F{U') < ^^^^ for all t < j. 



n 



Then the rest of the assertions follow using the steps in the proof of Theorem 23 



Hence let us concentrate on the event V^. It is enough to restrict to the event C/'^n{|Linax — 
v*\ < c\/A} n {-Lmin > u* — cA}. Here the rough idea is that, a large fraction of Lj/s has to 
be near v* in order to make Lmax — v* ■ Suppose Lmax = -^^ioio • Define the set 

A = {k : Lif^k < Lmax - Si} 

where 5i will be chosen later such that 6i + c-v/A < v* — u* . Note that (piuY < max{ii,n*} 
for all u and by assumption |-Lmax ~ v*\ < c\/A. Thus (p{Lij) < V-^max for all i,j and 
^{Liok) < V -^^max — ^1 max(l ~ ^i/'^) foi' k G A. Thus we have 

Lmax = Li^^jQ < A + - ^ (p{Li^ik)v{Ljok) < A + Lmax — 



2n 
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which clearly implies that ^ < Similarly define the set Aj = {k : Ljj. < Lmax — ^2} 

where 82 will be chosen later such that ^2 + c\/A < v* — u* . Using same idea as before, for 
j ^ we have 



-^max — ^1 ^ Li^j < A + Lj] 



\AA52 \AA 2(A + 5i , 
L^?i_^ or < — ^ := Mfsay . 



Choose 82 = Al/^5l = A3/5. Then we have 

i2 ^ n\A\ + nM + n^Sl 
2 



T M < 

-^max I ^ 



di O2 

Thus, by symmetry and Holders' inequality, we have 



n^A n2(A + 5i) n^5'^ , a,n/. 
— ^ \ ^ 7 H < 4n^A^/^. 



*|2- 



10 



K(logn)V5 9^,0 
ni/5 > ■ 



< 



for some constant K. Now using lemma 21 and equation ( [44^ we have, 



E 



|^(X) 



and E 



' n 



2 I 
6 ' 



< 



< 



Cn9/5(logn)i/5 
P(f/c)i/io 

Cn^/s (log n) 
P(C7'=)Vio ■ 



If P(C7'^) > (logn)'^/n, from inequality (|45[) we have 



P |^(X) 



2 



andpf|^-!M^|>irn-A0 



for some large constant K depending on /3, h. Now define the set 



S = <^ a; G {0, 1}" : 



*\3 



n 



6 



E{x) 



Using the same idea used in the proof of lemma 23 one can again show that 



logiZ^Fm)-fi^{v*)) 



< 



n 



K 

1/10 



(44) 



(45) 



for some constcint K depending on h. Tlie crucial fact is that IP(j^-^^max 

(Z) > (n* +v*)/2}r\ 

{Z € B}) is bounded away from zero when Z = {{Zij))i^j ~ G(n, (/?(f *)). Thus we have 



logZn-fMv*)) 



< 



n 



K 

1/10 ■ 



But this leads to a contradiction, since by equation p2|) we have 



\0gZn{P,h)>f{^{u*)) 



K 



n 



and f(ip{u*)) > f{ip{v*)). Thus we have P(C/'^) < (logn)^/n and we are done. 
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3.5. Proof of Theorem |15[ The proof is almost an exact copy of the proof of Theorem [8| 
Recall the definition of L 



(n - 2)„^ 



for i < j. 



(46) 



In fact we can write Lij explicitly horrible sum 



1 



E 



E 

h<t2<-<t^p-2 {a,b)eE{F) 
t;e[n.]\{ij} for all I 



E n 



X 



{k,l)eE{F) 



where the sum is over all one-one onto map vr from V{F) = [vp] to {a,b,ti, . . . ,^^^^,^2} 
where {7r(a), 7r(6)} = Now we briefly state the main steps. First we have E(Xjj | 



Moreover, using lemma 



20 



it is easy to see that | 
< C/3/n for every distinct pairs (^1,^2)) • • • , (^2fc-i;*2fe) 



rest) — 
where C is an uni- 



1 Xi2j~ii2j 



rest) = (p{Lij) 

Uj=i ^{Li^j 
versal constant. 

Now, fix 1 < z < J < n. Given a configuration X, construct another one X' in the following 
way. Choose vp — 2 distinct points uniformly at random without replacement from [n\ \ {i, j}. 
Replace the coordinates in X corresponding to the edges in the complete subgraph formed by 
the chosen points including i, j (except that we do not change Xij) by values drawn from the 
conditional distribution given the rest of the edges. Call the new configuration X'. Define the 
antisymmetric function F(X,X') := (n - 2)„^_2(iii - L[,j). and /(X) := E(F(X, X') | X). 
Using the same idea as before and Theorem [T] we have 

P {\Uj - 9ij\ >t)< exp(-cntV(l + P)) 



(47) 



where c is an absolute constant and gij is obtained from Lij by replacing Xm by ip{Lki) for 
all A; < L Note that there is a slight difference with the calculation in the triangle case, since 
we have to consider collections of edges where some are modified and some are not. But their 
contribution will be of the order of n^^. Also the conditions on ip arises in the following way, 
if all the -Ljj's are constant, say equal to u, then from the "mean-field equations" for Ljj's we 
must have 



ap{n 



2ep 
ap 



ip{u 



)vF-2 



E E E^X ' 

ti<t2<--<t^p-2 {a,b)eE{F) TT 
tie[n]\{i,j} for all I 



\e_p-l 



The next step is to show that under the conditions on ip, we have E \ Li 



for all i < j where K = K{I3, h) is a constant depending only on /3, h. The crucial fact is that 
the behavior of the function p>{u)^ — au where a > is a positive constant and A; > 2 is a 
fixed integer, is same as the behavior of the function ip{u) 



Now it will follow (using the same proof used for lemma |2l[) that 

E ^(X) - < Cn^l^ 

and E iV(X) - ' < Cn^^-^l'' 



ap 
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where C is a constant depending only on /3, h. The rest of the proof fohows using the arguments 
used in the proof of Theorem [Sl 



Proof of Theorem 14, Using the method of proof for the triangle case and the result from 
Theorem 15 the proof follows easily. □ 



Proof of Lemma\l6\ The proof is same as the proof of lemma 12 except for the constants. □ 



3.6. Proof of Theorem 17, Suppose cr is drawn from the Gibbs distribution ^(s^h- We 
construct cr' by taking one step in the heat-bath Glauber dynamics as follows: Choose a 
position / uniformly at random from 17, and replace the I-th coordinate of cr by an element 
drawn from the conditional distribution of the aj given the rest. It is easy to see that (cr, cr') 
is an exchangeable pair. Let 

F{a, a') := \n\ (m(cr) - m(cr')) = aj - a'j 

be an antisymmetric function in cr,a'. Since the Hamiltonian is a simple explicit function, 
one can easily calculate the conditional distribution of the spin of the particle at position x 
given the spins of the rest. In fact we have W,{ax\{o'y,y ^ x}]) = ta,nh.{2 13 dmx{cr)) where 
mx{cr) := ^ YlyeN^ is the average spin of the neighbors of x for a; e Q,. Now using Fourier- 
Walsh expansion we can write the function tanh(2/3dma;((T)) as sums of products of spins in 
the following way. We have 

2d 

tanh(2d/3m:,.(cr)) = ^afc(/3) ^ as (48) 

fc=0 \S\=k,SQN^ 



where 



^ / 2d \ k 

2d Yl tanh /3^f7, mcTj 
<Te{-i,+i}2d V «=i / i=i 



- ^ 2^ tanh \P}^c7,]\\aj (49) 

for k = 0,1, . . . , 2d. It is easy to see that ak{f3) = if /c is even and ak{P) is a rational function 
of tanh(2/3) if k is odd. Note that the dependence of on d is not stated explicitly. Thus 
using equation (48) and the definitions in ( |20[ ) we have 



/(a) = E[F(<T, a')\cT] = J] E[ax - a'x\a] 



m{a) — — tanh(2^dmx(cr)) 



= (1 - 2daimm{a) - J] ( a2fc+i(/3)r2fc+i(tT). 

Define 0fe(/3) := «2fc+i(/3) for fc = 0, 1, . . . , d — 1. Note that we can explicitly calculate 

the value of 6*0 (/3) as follows, 

/ 2d \2d 2 ( 2d \ 

w) = ^d E t^-h /3 E ^0 E = E d + J 

cr6{-l,+l}2d V i=l / 1=1 k=l ^ ^ 
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Now we have \F{a,a')\ < 2 and 

\f{cT) - f{a')\ < f |1 - + ^{2k + i)em] 
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fc=l 



\n 



m 



for all values of cr^a'. Hence the condition of Theorem [T] is satisfied with B = Q,C 
2\n\~^h{j3). So by part (ii) of Theorem [l] we have 



P \ M 



k=l 



>t \ < 2 exp 



— 1 
mm) 



for all t > 0. Obviously ^o(") is a strictly increasing function of /3. Also we have Oq{Q) = and 

9o(oo) := hm 0o(/3) 



A;=l ^ ^ 



For d = 1 we have 9q{oo) = 1 and for d >2 we have 



9o(oo) > 



1 



1 

4d-l 



. fc=l 



i2d 



2d 
d + k 

2d 

d 



2d 
d + 1 



2d 
d + 1 



8 /2d + 1 



22d+i \d + l 



and from the fact that EfeS-i (^°'fc^^) ^ 22°'+i we have 



1 /2ci+l 



<^<iford>2. 



22d+i ^ + 1 y - 4((^ + 1) - 3 

Hence for d > 2 we have ^o(oo) > 1 and there exists /3i G (0, oo), depending on d, such that 
1 - 6*0 > for /3 < /3i and 1 - 9o{l3) < for /3 > This completes the proof. 



3.7. Proof of Proposition 19 . The proof is almost same as the proof of proposition 17 
Define cr,a' as before. Define the antisymmetric function F{a,a') as follows 

F{(T,a') : = +tanh(/i)tanh(2/3dm/(cr)))(m(cr) - m{a')) 

= (1 + tanh(/i) tanh(2/3(im/(cr)))(cr/ - a'j). 

Recall that mx{cr) := '^yi^N^ '^y average spin of the neighbors of x for x G ft. Now 

under /x^^/i we have 

'E{ax\{cry,y / x}) = tanh(2/3dm^(cr) + h) 

tanh(/i) + tanh(2/3(imi:(cr)) 



1 + tanh(/i) tanh(2/3dm2^(<T)) 



Thus we have 

fia)=E{FicT,a')\cT) 



^ ^(1 + tanh(/i) tanh(2/3dm^(cr))) E(o-a: - a'^ 

) — tanh(/i) + — - ^^(tanh(/i)(Tx> — 1) tanh(2/3dm^.(cr)). 



m(cr 
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After some simplifications and using the definitions of the functions r, s we have 

d-l 

f{a) = (1 - 0o(/3))m(^) -Y,0mr2k+i{cT) 

k=l 

/ d-l \ 



- tanh(/i) 1 - ^fc(/3)s2fc+i(o-) I . 

Now for all values of <t, cr' we have 



k=0 



|/(«t)-/(OI< |^K/3)(l + tanh|/i|) 



and the proof onwards is exactly as in the proof of proposition 17 



3.8. Proof of Theorem [2| Assume that ^(0) > 0. We will handle the case "0(0) = later. 
Note that condition ([T]) implies that x'^/ip{x) is a nondecreasing function for a; > 0. Define 
the function 

ip(x) := ^ and 7(x) := 2 — ^ ^ for x / 
yj[x) ip{x) 

and ip{0) = 0,7(0) = 2. Clearly we have 2 - a < 7(x) < 2 for all x G E. Now, 
limsupx^Q ip{x) < limx>-->o+ = = (/?(0) as a < 2. Also ip{x) is differentiable 
in E \ {0} with 

(^'(x) = > for X / 0. (50) 

ip[x) 

Hence if is absolutely continuous in E and is increasing for x > 0. 

Define Y = f{X). First we will prove that all moments of '^{Y) are finite. Next we will 
estimate the moments which will in turn show that ip{Y)^^'^ has finite exponential moment in 
E. Finally using Chebyshev's inequality we will prove the tail probability. 

By monotonicity of ■0 in [0, 00) and definition of a, we have 

xib^ {x\ 

< — tV < " for ah X > 0. (51) 

il){x) 



It also follows from (50) that < {\og(p{x))' < 2/x for x > and integrating we have (p{x) < 
(/?(l)x^ for all X > 1. Hence ip{x) = (p{\x\) < (/?(l)(H-x^) for all x G E and this, combined with 
our assumption that E(|/(X)|'^) < 00 for all k > 1, implies that 'E{(p{Y)'^) < 00 for all fc > 1. 
Define 

"5(2 -a) + (^ + 1/4" 

Fix an integer k > 13 and define 



(2 -a 



i2 



> 3. 



^'2ik 1 ^'2k 2 

= TkT^ ^(^) = TkT^ ^^"^ X e E. 
tp'^yx) 'ip'^yx) 

Clearly E(|y5((y)|) < 00. Note that g, h are continuously differentiable in E as > 3. More- 
over, for X G Ewe have, |5''(x)| = h{x) |A;7(x) — 1| < (2A;— l)/i(x), h'{x) = {kj{x) — 2) x'^^~^ / ijj^ {x) 
and 

h"{x) = [{k^{x) - 2) {k^{x) - 3) + A:x7'(x)] 
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We also have 



X7'(x) 



xiIj'{x) 



1 



xip'{x)\ xij)"{x) 



ip{x) 



ip{x) 



> -1/4-5 



il^ix) 

for X G in. Now k > (3 implies that 

ikj{x) - 2) {kj{x) - 3) + kx-f'{x) > {k{2 - a) - 2) {k{2 - a) - 3) - k{5 + 1/4) > 

for all X. Thus h"(x) > for all x and h is convex in K,. 

Let X',F{X,X') be as given in the hypothesis. Define Y' = f{X'). Recall that iX,X') is 
an exchangeable pair and so is (Y,Y'). Using the fact that f{X) = 'E{F{X,X')\X) almost 
surely, exchangeability of {X, X') and antisymmetry of F, we have 

E{Yg{Y)) = E{f{X)g{Y)) = ^^iX, X')g{Y)) 

1 



E{F{X,X'){g{Y)-g{Y'))). 



(52) 



Now, for any x < y we have 



g{x) - g{y) 






x-y 




I 



g'itx + (1 - t)y) dt 



< {2k - 1) / h{tx + (1 - t)y)dt 



and convexity of h implies that 



h{tx + {I - t)y)dt < / {th{x) + {l-t)h{y))dt = {h{x) + h{y))/2. 
Jo 

HYgiY)) < ^—^my - Y')FiX,X')\{h{Y) + hiY'))) 



Hence, from equation (52) we have 

2k - 1 



{2k - 1) E{A{X)h{Y)) < {2k - 1) E(v^(y)/i(y)) 



(53) 



where the equality follows by definition of A(X) and exchangeability of iY, Y'). Thus for any 

(54) 



k > P we have, from (53), 



E{^{Yf)<{2k-l)ny^{Y)'-'). 
Using induction for k > f3 we have 

/ (2/c)!2/^/3! 



nv{Yr) < 



E{ip{Yf) for k>f3. 



2''k\{2(3)l 

Also Holder's inequality apphed to (Isl) for A; = /3 imphes that E(v3(y)^) < (2/3 - 1)^. Thus 



we have 



1 (2/3- 1)^= if < A; < /3. 



(55) 



Note that we have < + e ^ = 2 X]fc>o /{2k)\ for all x G R. Combining everything we 
finally have 



D2fc 



E(exp(0^(y) 1/2)) < 2 ^ — n^{Yf) 



< 



k=0 

2^+^/31 

w 



(2A.)! 

fc=S ■ A:=0 



2(2/3 - 1)^=^2^ 



< C^exp{e'^/2) 
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for all > where the constant is given by 

2(2/3 - 1)'=2'=A;! 



Cr := max 



< /c < /3 



(2A;)! 

Here we used the fact that {2k)l > 2'^^~^k^? /k. Now recall that (/? is an increasing function in 
[0, oo). Thus using Chebyshev's inequality for exp(0(^(x)^/^) with 6 = ip{tY/'^ we have 

P(|/(X)| > t) < C7^e-^^W'''+^'/2 ^ 

Now suppose that ■0(0) = 0- Foi" e > fixed, define i^e^x) = ip{x) + e. Clearly we have 
A(X) < Tps{f{X)) a.s. and ^pe satisfies all the other properties of ip including 

xijj'^{x) / iIji;{x) = x0£(x)/0(x) • ip{x)/{ip{x) + e) < a 
and xip"{x)/ipi;{x) = xip'^{x)/7p{x) ■ il){x) / {'4){x) + e) < 5 

for all X > 0. Hence all the above results hold for -(/^e and v'e(x) = x^ j^^ix). Now f (/? as 
e J, 0. Letting e J, we have the result. 

When -0 is once differentiable with a < 2, it is easy to see that the function h is nonde- 
creasing (need not be convex) in [0,oo) for k > f3 := [2/(2 — a)]. In that case we have 

/ h{tx + (1 - t)y)dy < max h{z) < h{x) + h{y) 
Jo z<^[x,y] 

for X < y. Hence we have the recursion 

E((/j(y)^) < 2(2A;-l)E((p(y)^-i) (56) 

for k> p. Using the same proof as before it then follows that 

P(|/(^)| >t)< Ce-^W/^ 

where C depends only on a. □ 
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